A “natural” Lexicalization Model for Language Generation
نویسنده
چکیده
We propose a general lexicalization model which accounts for how lexical units are selected and introduced in linguistic utterances during language generation. This model aims at “naturalness” by being based on actual lexical knowledge used in speech; consequently, it should be compatible with standard patterns of behavior shown by humans when they speak (flexibility in computing both content and form of linguistic utterances, prototypical types of mistakes and backtracking, etc.). The main advantage of our model, once implemented in automatic language generation, is that it takes into account fundamental differences that exist between lexical units, with regard to why and how they are used in texts. This is achieved by means of a stratificational approach to lexicalization, where each type of lexical unit is introduced at a proper level of representation, according to the role it plays in the enunciation. Section 1 offers a general characterization of the approach and makes explicit its main assumptions. Sections 2 to 4 successively examine the three levels of transition implied by the stratificational structuring of the model. Section 5 concludes with an examination of its relevance to the design of text generation systems.
منابع مشابه
Natural Language Generation of Museum Object Descriptions based on User Model
Natural Language Generation (NLG) techniques can be applied in generating virtual documents dynamically using information from a database (Dale et al, 1999). One of the applications of NLG techniques to generate documents dynamically is the web-based interactive virtual museum, VIGAN. NLG is used to generate the descriptions of the objects in a virtual museum dynamically based on the profile an...
متن کاملReusing MT Components in Natural Language Generation for Dialogue Systems∗
This paper describes ongoing research on NLG for dialogue systems. Sentence planning is performed by selecting the appropriate template, while previously developed transfer and generation components of a transfer–based MT architecture perform the lexicalization and linguistic realization processes in the generation process. This approach allows for multilingual generation since there is a clear...
متن کاملA Constraint-Based Model for Preposition Choice in Natural Language Generation
In this paper, we show how a constraint-based approach influences the modelling of preposition lexicalization in natural language generation. We concentrate on the linguistic description, which is the most challenging. The CSP procedures themselves are then rather straightforward. Preposition choice depends on the verb and its requirements, on the one hand, and the characteristics of the NP the...
متن کاملEnriching Answers in Question Answering Systems using Linked Data
Linked Data has emerged as the most widely used and the most powerful knowledge source for Question Answering (QA). Although Question Answering using Linked Data (QALD) fills in many gaps in the traditional QA models, the answers are still presented as factoids. This research introduces an answer presentation model for QALD by employing Natural Language Generation (NLG) to generate natural lang...
متن کاملArchitectures for Natural Language Generation: Problems and Perspectives
Current research in natural language generation is situated in a computational linguistics tradition that was founded several decades ago. We critically analyse some of the architectural assumptions underlying existing systems and point out some problems in the domains of text planning and lexicalization. Guided by the identification of major generation challenges viewed from the angles of know...
متن کامل